Collaborative annotation for person identification in TV shows
نویسندگان
چکیده
This paper presents a collaborative annotation framework for person identification in TV shows. The web annotation frontend will be demonstrated during the Show and Tell session. All the code for annotation is made available on github. The tool can also be used in a crowd-sourcing environment.
منابع مشابه
Benchmarking multimedia technologies with the CAMOMILE platform: the case of Multimodal Person Discovery at MediaEval 2015
In this paper, we claim that the CAMOMILE collaborative annotation platform (developed in the framework of the eponymous CHIST-ERA project) eases the organization of multimedia technology benchmarks, automating most of the campaign technical workflow and enabling collaborative (hence faster and cheaper) annotation of the evaluation data. This is demonstrated through the successful organization ...
متن کاملHuman detection and character recognition in TV-style movies
The objective of this master thesis is the recognition of human characters in TV-style video sequences. In order to recognize the same person at different time instances in a video sequence, the outward appearance of the person has to be described and learned with an appropriate model. The diversity in which humans can appear makes the task of human detection and character recognition to a part...
متن کاملMultimodal Person Discovery in Broadcast TV at MediaEval 2016
We describe the“Multimodal Person Discovery in Broadcast TV” task of MediaEval 2016 benchmarking initiative. Participants are asked to return the names of people who can be both seen as well as heard in every shot of a collection of videos. The list of people is not known a priori and their names has to be discovered in an unsupervised way from media content using text overlay or speech transcr...
متن کاملMultimodal Person Discovery in Broadcast TV at MediaEval 2015
We describe the“Multimodal Person Discovery in Broadcast TV” task of MediaEval 2015 benchmarking initiative. Participants were asked to return the names of people who can be both seen as well as heard in every shot of a collection of videos. The list of people was not known a priori and their names had to be discovered in an unsupervised way from media content using text overlay or speech trans...
متن کاملImproving speaker identification in TV-shows using person name detection in overlaid text and speech
This paper is dedicated to the use of auxiliary information in order to help a classical acoustic-based speaker identification system in the specific context of TV shows. The underlying assumption is that auxiliary information could help (1) to rerank n-best speaker hypotheses provided by the acoustic-based only speaker identification system, (2) to provide confidence score to refine a rejectio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015